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Abstract 

A syntactical proof is given that all functions definable in a certain affine linear 
typed A-calculus with iteration in all types are polynomial time computable. The 
proof provides explicit polynomial bounds that can easily be calculated. 



1 Introduction 

Recent research [|], Q ^ has provided many characterizations of Ptime (the class 
of all functions computable in polynomial time) by means of appropriate restrictions 
of the terms in Godel's T [Q]. Consider the following definition of an exponentially 
growing function. 

double([]) [] exp([]) :=[tt] 

dotible([a | I]) [a, a | dotible(^)] exp([a | I]) double(exp(^)) 

Approaches based on predicative recursion [|l9[ | , 16 1 argue that the exponential growth 
in this example is due to the way double is called: the previous value exp(£) of the 
outer recursion is the recursive argument to the inner recursion. 

Although such approaches can capture all functions computable in polynomial time, 
many natural algorithms are excluded, particularly if they involve nested recursions. 
Standard examples are sorting algorithms like insertion sort, which has a similar recur- 
sive structure as exp: 

insert(a, [a] 

insert(a, [6 \ t]) := [h' \ insert(a', i?) ] a', 6' a permutation of a, 6 with 6' < a' 

sort([]):=[] 
sort([a I i]) :~ insert(a, sort(^)) 



* supported by the "GraduiertenkoUeg Logik in der Informatik" of the "Deutsche Forschungsgemein- 
schaft" 
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Caseiro |g] studied many related examples and reached some (partially semantic) cri- 
teria for algorithms in order to ensure polynomial time complexity. For the insertion 
sort algorithm the essential point is that insert does not increase the size of its input. 
Hofmann [|l^ took up this line of research and formulated a new term system with a 
special type O of tokens that accommodates nested recursion, but only allows to define 
non-size-increasing Ptime functions. The basic idea is that if a function increases the 
size of its argument (like a successor function), then one has to "pay" for the increase 
by providing a "token", i.e., a term of type O. 

Hofmann [ [To| ] proved this result by means of inherently semantic concepts, such as 
the set-theoretic interpretation of terms. We present a new proof of his main result, 
which apart from being simpler provides more insight and yields an explicit construc- 
tion of time bound polynomials. Although Hofmann's proof is constructive and thus 
also contains bounds, these are deeply hidden. 

The method developed here has several benefits: 

• A reduction relation is defined in such a way that the term system is closed under 
it. Therefore calculations can be performed within the system. 

• We not only show that every definable function is polytime, but give explicit 
polynomial bounds for the number of reduction steps that can be determined 
easily for any given term. 

• Hofmann's semantical size measure §3.2] (minimal upper bound of growth) 
is replaced by the syntactic concept of the number of free variables. Hence the 



role of the O-type becomes more transparent, as we will show (in Lemma 4.3) 
that there are no closed terms of this type. 

A preliminary version [|l]] of this work has already been published. Apart from giving 
more technical details and elaborated proofs, the following aspects are added: 

• The previous estimate of the number of reduction steps referred to a fixed re- 
duction strategy. Here we show that this requirement can be relaxed somewhat, 
without loosing the sharp estimate on the length of reduction sequences. The 
main tool is an appropriate modification of the size measure d {■). 

• It is shown how the approach covers more complex data structures such as binary 
labeled trees, for which iteration involves two recursive calls. 



Hofmann's paper [ ]l0| ] is of course the starting point of ours. In the journal version Jill ] 
some new aspects are added, among others the definition and justification of a similar 
system which captures P space, and an operator for divide-and-conquer recursion. We 



shall show in Section 5.2 that our simplified approach can deal with the latter; however. 



it is not known whether it also suffices for Hofmann's characterization of P space. 



Jones [|15p, in a paper called "The expressive power of higher-order types or, life with- 
out CONS", directly relates programming languages and complexity theory. In a pre- 
vious paper [|l4|] (see also [|l3]l) he had characterized the power of first-order read-only 
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programs in complexity terms; the results are extended [|l5]] to arbitrary data orders and 
tail recursive programs. As Jones notes, the root for his Ptime result (for data order 
0) is an early paper of Cook [Q| on "two-way auxiliary pushdown automata". How- 
ever, no characterization of Ptime in terms of recursion with unlimited data orders is 
obtained. 

Jones work is certainly related to the present one: in his approach it is important 
whether constructors of structured data are allowed or not. Moreover, Jones' paper 
is written from a broader programming point of view, giving special emphasis to the 
program control structure (general recursion, tail recursion or primitive recursion). The 
present paper (for simplicity) concentrates on the effects of higher order primitive re- 
cursion (i.e. Godel's T). 

Leivant and Marion [ p^ give a A-calculus characterization of Ptime. The main nov- 
elty is the use of the concept of "tiers" in the sense of different representations of data: 
as words (i.e. terms of base type), or as Church-Uke abstraction terms (expressions of 
higher type). This leads to a nice characterization of Ptime. Leivant [ |l7| ] gives par- 
ticularly elegant proofs of characterizations of Jones [|l^] for Pt ime and of Beckmann 
and Weiermann [^] for the elementary functions. He also treats a control-based sub- 
calculus of the system in and — more importantly — obtains a characterization 
of P space along the same lines. This is achieved by the notions of an "input driven" 
term, i.e., a term with the property that no recursion argument has a free variable bound 
in the term, and the more special notion of a "solitary" term, where every higher type 
abstracted variable has at most one occurrence in the kernel. The solitary terms are the 
ones used for his characterization of Ptime. As with all extensional characterizations 
of Ptime, the question remains as to whether interesting Ptime algorithms can be 
represented directly. 

2 Types and terms 

Definition 2.1 (Finite types). The set of linear types is defined inductively as 

p,T 0|B|r^p|T(g)p|rxp| L(t) 



So types are built from the base type B of booleans and a special type O. L(r) denotes 
the list-type over r. The type of binary numerals, as used by Hofmann [[lO||, can be 
defined as L(B). The type O may be interpreted as a pointer to free memory [|§]; 



Lemma 4.3 will show that no closed terms of type O exist. Canonical terms of data- 
types may still contain free variables of that type. The type O — o r ^ L(t) ^ L(t) 
of the cons^ function together with the linear typing discipline guarantees that the 
length of lists and the number of free variables coincide. 

T ® p and T X p both represent ordered pairs. However, the linear meaning of those 
types is different. Each component of a tensor product t p can be used, whereas in 
the case of an ordinary pair one can choose only one component to be used. As can be 
seen from Definition p!^, (g) corresponds to while x corresponds to B. 
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Terms are built from variables, denoted by x, y, z, and typed constructor symbols c. 
Each variable has a type, and it is assumed that there are infinitely many variables of 
each type. The notation should express that the variable x has type t. 

Definition 2.2 (Terms). The set of terms is inductively defined by 

r, s, t x'^ \ c \ Xx^ . t \ {t, s) \ ts \ {t} 

These terms should be seen as our "raw syntax"; only correctly typed terms (in the 



sense of Definition 2.4) represent meaningful functions. 



The idea for the {•} term construct is taken from Joachimski and Matthes 1 12 1. Terms 
of this form appear only as arguments in an L(t) elimination. This explicit marking of 
the step term in iteration constructs is not only technically convenient but also allows 
to directly read off the time complexity: the degree of the bounding polynomial is the 
nesting depth of braces and the symbols within n braces contribute to the coefficient of 



X" (see Definition 4.5 for details) 



The notations ts and Xx'^ .t are defined as usual, so Ax"^, yP.t is an abbreviation for 
Xx"^ .{XyP . t). Terms that only differ in the naming of bound variables are identified. 

Definition 2.3 (Constructor symbols). The constructor symbols and their types are 



tt 


B 




fF 


B 




nil,- 


L(r) 




conSi- 


O - 


--L(r)-L(r) 




T ^ p- 


^3 T p 



A context is a set of variables. For two contexts Fi and F2 the notation Fi, F2 stands 
for the union, expressing that Fi and F2 are disjoint. We also write for the singleton 
context {x'^}. 

The next definition states which terms of our raw syntax are correctly typed. F h is 
to be read as "t is a typed term of type r with free variables in F". 

Definition 2.4 (F h V). The relation F h is inductively defined as follows: 



F, .T^ h x^ {Var) 
c of type T 

F h (Const) 
F U {x^} h tP 

{Xx^ty-^p (^+) 
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r h r sp 



rh{t,sy'"' (x+) 
r h t^^'p 

r K (ttt)' (xr) 
r h 



rh(tffr (X-) 



ri,r2h (t(s,r»" (B-) 



^l,^2K(^(Ax^2/^s))^ (®-) 



rh(ns}r"^'' (Mr)-) 



It is crucial that the step term in (L(t) ) is closed. Otherwise it would not be pos- 
sible to define reduction rules in such a way that typed terms reduce to typed terms 



(Lemma 3.5 ), since free variables in the step terms would be duplicated. 



The typing system is based on elimination rules: For every type different to O there is a 
rule describing how to use terms of this type; in all these rules the elimination is written 
as application|j. For the right hand side of such an application a notation is chosen 
that expresses the computational behavior of the terms used for the elimination. This 
avoids duplication of syntax, e.g., with the pair {t, s) we have a notation expressing 
that exactly one of the terms t and s will be needed. This syntax is also used for the "if 
. . . then . . . else ..." construct t {s, r). 

Data-types, i.e., types from which all the stored information can be retrieved by a single 
use of an object of this type, are introduced by constants. In the system we have two 
forms of abstraction with special introduction rules: the A-abstraction and the pair. 
Note that the pair really is an abstraction, as only part of the stored information can be 
retrieved by a single use. The tensor t ® p however is a data-type as a single use can 
access both components, hence we have a constant ®r,p to introduce it. 

' It should be noted that the same syntax would appear in the standard implementation in the untyped 
A-calculus. For example one should think of (X) as being \x, y, z.zxy. 
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This way of introduction allows the relatively simple Definition 3.3 of the reduction 
relation expressing that we may reduce within data, but not under abstractions. 

Immediately obvious from the definition is 

Proposition 2.5 (Weakening). IfT h and T cT' then V h T. 



Rule {-^^) could as well have been written 

r, h tp 
r h {Xx-^.ty 



requiring that the bound variable does not occur in the context afterwards. In our for- 
mulation it is easier to recognize weakening as a derived rule. As we identify a-equal 
terms, weakening holds in the alternative formulation as well and both formulations 
are in fact equivalent. 

It might seem odd that in this calculus a typed term can have an untypable subterm {t}. 
An obvious definition would introduce a new form of application, e.g., if r, s are terms 
then so is r {s], rather then a new term former {s}. However, the present approach is 
technically more convenient, since it allows the simple classification of terms according 



to their head form (cf. Lemma 2.8 ) 



It should be noted that with some technical overhead one could resolve this problem 
by separating terms and so-called elimination terms, as in work of Joachimski and 
Matthes [|l2|]. But in the present rather simple situation this seems to be an overkill. 

The notation expresses that there is a F such that T \- . The smallest such F is 
called the set of free variables. By induction on the definition of F h one easily 
verifies 

Lemma 2.6. For each term t there is at most one type r such that V . In this case there 
is a smallest F with T \- V which coincides with the set of free variables of t in the 
usual sense. 



As already mentioned, the main restriction of the typing calculus is linearity, that is, in 
an application tit2 the free variables of ti and t2 have to be disjoint. This is stated ex- 
plicitly in the rule (^^), but holds for all other "applications" as well. More precisely 

Lemma 2.7 (Linearity). Assume {tit2Y. Then either t2 — {^2} with closed t'2 or 
else there are types t' and t" and disjoint contexts ri,F2 such that Fi h ti"^ and 

T2 K t2^" 

Proo^ The last rule of a derivation of F h (tit2)^ must be one of (— o~), (x^), (xj~), 
(B~), ((8)^) or (L(r)~). In each of these cases the claim is trivial. For example, in the 
case we conclude by two applications of (^^) that F2 K (Ax"^, .sY^''^'^ . 

□ 

^It should be noted that this is just an inspection of the rules and in particular no induction is needed. 
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The fact that all eliminations are written as applications ensures that all typed terms 
have a uniform appearance. As can easily be verified by induction on the definition of 
the • h ■ relation, we have 

Lemma 2.8 (Head form). Assume T h t^. Then t is of the fonn xt, ct, (Ax^. s)tor 
(s, r) t. U 

It should be noted that this lemma, although technically trivial, turns out to be crucial 
for the further development: As we are taking vector notation serious and only have 
constants for the introduction of data-types, case distinction according to this lemma 
(and further according to the constant c in the case ct) is essentially case distinction 
according to the "form of the next canonical redex", without the need of defining such 
a notion. 



3 Reductions 

The reduction rules to be defined are all correct with respect to the set-theoretic se- 
mantics (cf. Hofmann [[ll], §2.1]). In order to control the effects of recursion we allow 
reduction of an iteration only if the argument is already calculated, i.e., if the argument 
is a list. 

Definition 3.1 (Lists). Lists £ (with n entries) are terms of the form 

conSrdf al{. . . (conSrd'^a^nilr)) 
where the di, . . . , dn stand for arbitrary terms of type O. 

It should be noted that we could also have required the d to be variables and get the 



same results. However, this definition allows more reductions (see Definition 3.2) and 
therefore is slightly more flexible when dealing with terms that are not almost closed, 
i.e., contain free variables of types other than O. 

Definition 3.2 (Conversions), is defined as: 

{Xx'^.t)s i~> t[s/x] 

{t, s) tt ^ t 

{t, s) fF ^ s 

tt {t, s) ^ t 

ff {t, s) 1-^ s 

<SiT,pts{\x'^ ,yP.r) ^ r[t,s/x,y] 

nilr {t} s 1^ s 

consrd^ ae{t}s td^a{£{t}s) ^ a fist 

Although they look quite similar, the rules {t, s) tt ^ t and tt {t, s) t actually have 
a very different meaning: The first rule says that we can unfold a projection once the 
argument of the term of x -type is in canonical form, whereas the other rule tells us to 
take the if-branch, once we know that the conditional evaluates to tt. Also notice the 
different typings of the two rules. 
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Definition 3.3. The reduction relation t ^ t' is inductively defined as follows: 

t^t' t^t' s ^ s' 

t t' ts t's ts ts' 

The requirement that we can only unfold one step of an iteration if the argument is 



a full list (in the sense of Definition 3.1) is crucial for our method of estimating the 



length of reduction sequences (cf. the introduction to Section m. This can be seen 



from the corresponding case in the proof of Theorem 4.8 Apart from this restriction 
and the requirement that one cannot reduce under abstractions the reduction strategy is 
arbitrary. 

As usual, we call a term t normal if it cannot be further reduced, i.e., if there is no t' 
such that t ^ t' . 

Lemma 3.4. T/Ti U {x''} h i"^ and V2 h s'' and moreover Ti and T2 are disjoint, 

thenTi,T2 h (t[s/xP]Y. 



Proof. Induction on t, using Lemma 2.7: If t is a variable or a constant the claim is 



obvious. If t is of the form t — tit2 then by Lemma 2.7 either t[s/a;] = ti[s/x]t2 or 
t[s/a;] — ti{t2[s/x]). We apply the induction hypothesis to the corresponding subterm 
and can type t[s/a:] by the same rule used to type t. If t is of the form t = Xy.r, then t 
must be typed due to (—0^), hence Fi U {x, y} h r'^ and we may apply the induction 
hypotheses to r (without loss of generality y does not to occur in and then conclude 
the claim by (^^). Similar if < = {ti,t2). □ 

Lemma 3.5 (Subject reduction). IfV h and t t', then T h t''^. 

Proof. Induction ont ^ t' shows that only conversions have to be considered. The 



only non-trivial case is handled in Lemma 3.4 . □ 



Next we show that we have sufficiently many reduction rules, i.e., that normal terms 
have the expected form. As the size of a term is its number of free variables, we also 
have to consider non-closed terms. As O is the only base type without closed terms, 
we can restrict ourselves to almost closed terms, i.e., to terms with free variables of 
type O only. It should be noted that (for example) an arbitrary list can be encoded as 
an almost closed term. 

Proposition 3.6. Every normal, almost closed term of type L(t), B or O is a list, tt or 
ff, or a variable of type O, respectively. 



Proof. Induction on t and case distinction according to Lemma 2.8: If t is of the form 
xt then X has to be of type O and hence t has to be empty (since there is no elimination 
rule for the type O). If t is of the form {\x.s)t or (s, r) t, then t has to be empty as 
well, for otherwise the term would not be normal. So the interesting case is if t is of the 
form ct. In this case we distinguish cases according to c. If c is tt or fF, nil^ or ®r,p, 
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then t can only consist of at most 0, 1 or 2 terms, respectively (for otherwise there were 
a redex); hence we have the claim (or the term is not of one of the types we consider). 
In the case conSrdart we apply the induction hypothesis to r yielding that t is of the 
form £t with a list £. So if t would consist of more then one term there would be a 
redex; but if t is a single term, then the whole term would have arrow-type. □ 

It should be noted that in the above proof the only place where the induction hypothesis 
actually is needed is to argue that in conSrd^ ar the subterm r has to be a list as well 
(and hence the whole term is a list). In particular, these statements need not all be 
proved simultaneously, but only simultaneously with the statement that every normal, 
almost closed term of list-type is in fact a list (which is necessary due to the side- 
condition on the reduction rules for lists). This modularity is a feature of the vector 
notation, i.e., the consistent use of elimination-rM/e.s written as applications: in this way 



the (syntactical) Lemma 2.8 gives easy access to the "canonical" redex by showing the 
corresponding introduction. It should be noted that the use of elimination-con.sfanfi 
("iterators") would have messed up this modularity, as in the case of an elimination 
constant followed by some arguments one would in fact need some form of induction 
hypothesis stating that the argument to be eliminated is introduced canonically. 



4 Length of reduction chains 

Before continuing we give a sketch of the main idea of the proof in order to motivate 
the following definitions. Since the system is linear, /3-reduction reduces the num- 
ber of symbols (since the A disappears). The same is true for the reductions due 
to projections and the "if . . . then . . . else". So the only case where the number 
of symbols is increased is the case of iteration. However, if we unfold an iteration 
completely, i.e., if we reduce consT-df r[(. . . {conSrdf^r^^nilr)) {h} t inn + 1 steps 
to hdfrl{. . . (hdf^rj^t)) then the {-j's disappear! So by making them "sufficiently 
heavy" the total weight of the term reduces. As however the weight necessary for the 
{•}-construct depends not only on the term within the braces, but also on the length n 
of the numeral we iterate on, we cannot assign this term a fixed number; instead we 



assign a polynomial(ly bound function) in Definition |4-.5| But what to plug in? Noting 
that we have n terms df,... ,d^ of type O and remembering the idea that terms of 



type O should contain free variables (as will be proved in Proposition 4.3) we find that 
the number of free variables is just the right upper bound for the length n (since the d 
are applied to one another, their free variables have to be disjoint). 

This would lead to a proof that every reduction strategy that always unfolds iterations 
completely is polynomially bounded. In order to get results for every reduction strat- 
egy we notice that within the subterm conSi-df r[(. . . (consT-d^r^nil,-)) {h}t the 
iteration unfolds at most n times, even if the actual number of free variables in the 
whole term is largerfl Using this information we can limit the assigned polynomial(ly 



bounded function) to get a measure that actually decreases in every step (Theorem 4.8 ). 



' This subterm might not unfold at all if it is positioned under a A -abs traction that remains in the normal 
form. We do not allow to reduce under A-abstractions (see Definition 3.31) since the number of free variables 
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So we use three different measures for terms: the number of free variables, which 
corresponds to the "size function" in Hofmann's work [|lO||; the length, which is the 
number of symbols of a term that can be accessed (using the interpretation that you can 
only access one component of an ordinary pair) and the polynomial which is the upper 
bound for the complexity of a function. 

Definition 4.1 (Length). The length \t\ of a term t is inductively defined as follows: 



|c| := |. 
\ts\ 

\Xx^.s\ 

\{t,s)\ 

m\ 



= 1 

-\t\ + \s\ 
= |s| + l 

= max{|i| , |s|} + 1 
= 



As the length \t\is essentially used to handle /3-redexes, the length of {-j-terms (which 
are closed terms!) is of no importance. So for simplicity, and to obtain slightly sharper 
results (as the length occurs in the Definition of the polynomial bound) the value 
has been chosen. It should be noted that all the results would also hold with any other 
"reasonable" definition for \{t}\, such as \t\. 

Lemma 4.2. IfTi h andV2 h and moreover Ti, T2 are disjoint, then \t[s/x\ \ < 
\t\ + \s\. In particular, \{Xx'^.t)s\ > \t[s/x]\. 



Proof. Induction on t, using the fact that t is typed and therefore in the case of an 
application only one of the terms can contain the variable x free (compare Lemma 2.7). 

For instance, if t is tit2, then the last rule of a derivation of F h ti,t2^ must be one 
of (^~), (xq^), (xf)' )' (® ) or In each of these cases the claim is 

obvious. □ 



Lemma 4.3. Every term of type O contains a free variable, i.e., ifV h , then F 7^ 



Proof. Induction on |i| and case distinction according to Lemma 2^. The case xt is 
trivial. In the case {\x.s)t the t cannot be empty (for otherwise the term would have an 
arrow type), so we can apply the induction hypothesis to the (by Lemma shorter 
term t[ti/x]t2 ■ ■ ■ tn- Similarly for (r, s) t, where we apply the induction hypothesis to 
rt2 ... tn 01 st2 .■■ tn depending whether ti is tt or ff (note that by typing one of these 
has to be the case) and ®r,pt, where we again use Lemma \.2. For conSrt we use 
the induction hypothesis for ti. For nil^t use the induction hypothesis for ^2 • • ■ tn . In 
the cases ttt and fft, by typing, ti has to be of the form (r, s) and we can apply the 
induction hypothesis to (say) rt2 . ■ - tn- □ 

under a A-abstracdon is potentially higher and therefore numerals and lists are potentially longer. This 
restriction corresponds to not allowing the use of a "potential resource", i.e., a resource that is not yet present. 
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The following corollary is formulated for type 1 functions only. However, if one ac- 
cepts the number of free variables as a reasonable size measure for definable functions 
for higher types as well, then the corollary trivially remains valid for every definable 
function. 

Corollary 4.4 (Non-size-increasing property). Every function of type L(t) -o L(t) 
definable by a closed term has the property that the output is not longer than the input. 



Proof. Lemma shows that for closed terms of type L(t) the usual length and the 
number of free variables coincide (due to the typing O ^ r ^ L(t) ^ L(r) of 
the cohSt- function). The number of free variables trivially does not increase when 
reducing the term to normal form. □ 

Let N be the set of natural numbers and W°^^ be the set of all functions from N to 
N that are (pointwise) bounded by some polynomial. We write X for the identity on 
the natural numbers, f + g and / • g for the pointwise sum and product in and 
Xn for the minimum of the identity and n, i.e., X„(to) — min {n, m}. N is treated 
as a subset of NP°'y by identifying a natural number with the corresponding constant 
function. Let =^ be the pointwise order on Then there exist finite suprema (the 

pointwise maxima) in f^P°^y with respect to ^, denoted by sup^ {•}. 

It seems that by considering functions we leave the realm of syntax. But as we will 
only use functions that are explicitly defined by pointwise sum, product, maximum 
and minimum from the identity and constants we could as well restrict ourselves to 
these functions only. It should be obvious how these functions can be encoded by 
some finite (syntactical) object. 

Definition 4.5 (Polynomial bound {■) of a term). For each (not necessarily typed) 
term t we define a polynomial (t) G by recursion on t. 

i^{x) := ?9(c) 

■d (t) + Xn ■ i!) (h) + Xn ■ \h\ if i is a Hst with n entries 
^ (fs'j ) and s is of the form {h} 

d (t) + d (s) otherwise 
■d(Xx''.t) -.^^{t) 

^{{h}) -.^X ■'d{h) + X -Ihl 

n 

We write (t) for Y^'d{ti). 

i=l 

Immediately from the definition (since Xn =^ X) we get 

Proposition 4.6. t? (ts) ^■d{t)+d (s). □ 
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Lemma 4.7. IfTi h r2 h and ri,T2 are disjoint, then § {t[s/x\) 4 (t) + 

I? (4 



Proof. We prove this by induction on t. The only non-trivial case is when t is an 
appUcation (note that the case {h} does not occur since t is typed). So let t be of the 
form rr'. We distinguish cases whether r is a list or not. 

If r is a list with n entries then, as t is typed, we know that r' must be of the form 
r' = {h} and therefore closed. r[s/a::] is easiliy seen to be again a list with n entries. 
Therefore we get 

■d ({rr')[s/x]) — d {r[s/x]r') 

^^{r[s/x])+Xn-^ih)+Xn-\h\ 
4d{r)+^{s)+Xn-^{h)+Xn-\h\ bylH 
= z9 {rr') + z9 (s) . 

If r is not a list, then from the fact that t is typed we know that at most one of the terms 
r and r' contains the variable x free. In the case x G FV (r) (the other case is handled 
similarly) we have 

"i? {(rr')[s/x]) — '& {r[s/x]r') 



=4 {r[s / x]) + d {r') by Proposition 4.6 

4^ir)+i9{s) + i} (/) by IH 
= ^{rr')+^{s) . 



This completes the proof. 



Theorem 4.8. Assume T ^ V and N > |FV {t)\ where iV e N. Ift -> t', then 

d{t) {N)^\t\>d{t') {N) + \t'\ 
In particular, any reduction sequence starting from t has length at most 

^W(|FV(t)|) + |i| 

Proof. We prove this by induction on the definition of the relation t ^ t'. 

Case rs ^ r's via r ^ r' . We distinguish whether r is a list or not. 

Subcase r is a list with n entries. Then by typing restrictions and from Lemma 4.3 we 



know that n < |FV {t)\ < N. Also s has to be of the form s = {h}. r' is again a list 
with n entries, so 

^irs){N) + \rs\ 

= ^lr){N)+n-^{h){N)+n-\h\ + \r\ + \s\ 

> i}{r'){N) + n-i}{h){N)+n-\h\ + \r'\ + \s\ by IH 

= i?(r's)(iV) + \r's\ 
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Subcase r is not a list. Then 



d{rs){N) + \rs\ 

= d (r) (N) + -d (s) (N) + \r\ + \s\ 

> ^ {r') (N) + i) (s) {N) + \r'\ + \s\ by IH 

> 'd{r's){N) + \r's\ by Proposition ^.6 



s'. Then r is not a list, since otherwise, s were of the form 
{•} and we must not reduce within braces. Therefore this case can be handled as the 
second subcase above. 

Case t ^ t'. We distinguish subcases according to the form of the conversion. 

Subcase (Xx. t)s ^ t[s/x]. We have 

^i{Xx.t)s){N) + \{Xx.t)s\ 

= ^ (t) (N) + ^ (s) (N) + \t\ + l + \s\ 



> ■d{t[s/x]){N) + + l + by Lemma 

Subcase ®T-_prs(Ax, y.t) t[r, s/x, y]] is handled similarly. 

Subcase conSrd^ a£ {s} r ^ sd^ a{£ {s} r)] with £ a list with n entries. Then by 



Lemma 4.3 and the linear typing discipline n + 1 < |FV (consrd {s} r)\ < N. 



Therefore we have 



{){conSrd^ ae{s}r) {N) 
+ \conSrd^a£{s}r\ 

■d (d) (N) + d (a) {N) + i9 {£) (N) 
+ {n + 1)§ (s) (iV) + (n + 1) \s\ + ^ (r) (N) 
+ l + |d| + |a| + |^| + |r| 



and on the other hand 



^{sd'^a{£{s}r)) (N) 
+ \sd'>a{£{s}r)\ 

d (s) (N) + I? (d) (N) + d (a) (N) + i9 {£) {N) 
+ n-^{s){N) +n- \s\ + d (r) {N) 
+ |s| + |d| + |a| + |£| + |r| 

which obviously is strictly smaller. 
Subcase tt (r, s) t—i- r. We have 

i9(tt (r,s))(iV) + |tt(r,s)| 

= sup^{i?(r),i?(s)}(Af) + l + max{|r|,|s|} 
> ^{r){N) + l + |r| 

The remaining subcases are similar to the last one. 
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Corollary 4.9 (Hofmann, 1999). 7/0 h tL(B)^ •••^l(b)^l(b)^ then t denotes a Junc- 
tion computable in polynomial many steps. 

Proof. We only treat unary functions. Let £ be an almost closed normal list. Then 
•& (ti) = 1? {t), as 1? {£) = 0. Therefore an upper bound for the number of steps is 
1? (i) (|FV (^) I) + |i| + \i\, which is polynomial in the length of the input i. □ 

Strictly speaking, the above result does not show that functions definable by closed 
terms are polynomial time computable, since no machine model has been provided. 
However, it is rather obvious how these reductions can be implemented in constant 
time (with respect to the size of the input): First note that the only terms that have to 
be duplicated are the step terms of iterations which are not part of the input and — as 
they are closed terms — no part of the input can become substituted into them. Next 
remark that all /3-redexes are linear and hence can be implemented by the usual pointer 
switching techniques. In the case of (•, •) an indirection node is used for the shared 
variables of the two components. Our reduction strategy does not reduce within a pair 
until the projection or the "if . . . then . . . else"-statement is resolved. As pairs are not 
part of the input, the amount of extra work in each resolution step is independent of the 
input. 

Example 4.10 (Insertion sort). Let r be a type equipped with a linear ordering. As- 
sume that this ordering is represented by the term <x of type r -o T —o B®(r®r), i.e., 
<irts — »■* (8)B,r®Ttt(®T,Tis), if t is "smaller" than s, and <irts ^B.r^rff (<H)T,ris) 
otherwise. 

Using this function, we can define a sort function of type t ^ t ^ t (E) t for two 
elements, i.e., a function of two arguments that returns them in the correct order: 

Now we can immediately define a function of type L(t) ^ O ^ t ^ L(t) inserting 
an element at the correct position in a given sorted list. At each step we compare the 
given element with the first element of the list, put the smaller element at the begiiming 
of the list and insert the larger one in the rest of the list. 

insert = Xl.l{Xxf ,yl,p'^^'^^^^'^\x2 ,1)2- <lr yiyii^zl , Z2 consTXizi{px2Z2))} 
(Aa;*, y"^. cons^ajt/nil^) 

Then insertion sort is defined as usual: 

sort = AZ^ . Z I Aa;* , t/^ , /^^^^ insertZxy | nil^ 

Counting braces, as 1? (<!j.) = 1? (<t)> we get 

1? (sort) = X • 1? (insert) + 0{X) = • 1? (<;) + 0{X^) = • 1? (<^) + O(X^) 

This reflects the fact that insertion sort is quadratic in the number of comparison oper- 
ations. 



14 



By simple modifications of (the proof of) [ p^ §4.3] we may conclude that many "nat- 
ural orderings", e.g., the normal ordering on binary coded natural numbers, can be 
defined in the given term system. 

However, it should be noted that it is not necessary that every interesting ordering is 
definable in the given system. It would also be possible to just add a new symbol 
<Jt with the conversion rules ^^.^s i-^ '8'B.r®Ttt(®r,T^s), if t is "smaller" than s, 
and O^-is i-^ (S)-Q .^^^f{[(^^ .^ts) otherwise. With 4 the above theory remains 

valid and shows that there are at most of the newly introduced conversions in a 
normalizing sequence. Therefore this theory can be used to calculate the number of 
calls to a "subroutine", even if the subroutine itself is not definable in the given system, 
or not even polynomial time computable. 

5 Extensions of the System 

The syntactical analysis of the system allows various extensions which we only sketch 
here, giving sufficient detail to reconstruct the proofs. 

5.1 Full Polynomial Time 

The system so far only contains non-size-increasing functions, and hence cannot con- 
tain all Ptime functions. New results of Hofmann [|[] show that indeed all these 
functions (and hence in particular all Pt ime-predicates) are already within the present 
system. 

Here we shall briefly sketch an approach to obtain all Pt ime functions, that might give 
some insight into the way the restriction to non-size-increasing functions works. Its 
motivation was to avoid explosion of growth by iterating over already aggregated data- 
structures. Yet in the definition of Ptime, the only large data-structure of a Turing 
machine is its tape. Moreover, a Turing machine does not iterate over its tape but 
instead modifies it locally. 

The central idea lies in the observation that size is represented by the number of free 
variables. Hence, we can add a type l that allows closed terms for objects that are se- 
mantically of arbitrary size. On this type, we can then define functions that are seman- 
tically size-increasing, like the extension of the Turing-tape, but are from a technical 
point of view non-size-increasing, in that they do not require an argument of type O. 

Iteration on this new type l would lead beyond polynomial time, as the number of 
iterations that a loop (i.e., an L(t) -elimination) unfolds to is no longer immediately 
related to the input (via the number of free variables). 

The term system is extended by adding a new ground type t, adding the following 
constants with their respective types 

O L p L —o i 

50 L ^ L iszero t ^ (B ® t) 

51 L i head t ^ (B ® t) 
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I— > is enriched by 



po 



iszeroo 
iszero(Sii' ) 



5B,i.tto 

5B,.ff(s»t') 



heado 

head(sot'^) 
head(sit'^) 



®B,.ff(Sot') 
®B,.tt(Sl<') 



The definitions of the relations F h and t ^ t' remain unchanged. Proposition 2.5 



and Lemmata 2.6, 2.8 , 3.4 and 3.5 remain valid with almost identical proofs. 



We call terms of the form Si-^{. . . (si^o)) short numerals. As in Proposition 3.6 we 
show that every normal, almost closed term of type i is a short numeral (and therefore 
closed). 

The definition of the term length \t\ had the property that for all reductions except 
iteration the length decreased. To retain this property we define |iszero| = |head| = 3 
and keep the rest of Definition i.l. In particular, every constant different to iszero and 
head still has length 1. Then Lemmata 4.2 and 4.3 remain valid. 



The definition of d (t) remains unchanged. Then (with identical proof) Lemma f4.7| 
holds and also the main Theorem 4.8. In particular, the extended system still consists 
of P time functions only. 

To show that every Pt ime function can be defined by a closed term of type N ^ t, we 
code the configuration of a Turing machine (with N states {So, . . . , Sn^i}, working 
over the alphabet {0, 1}) with the symbols iq . . . «fc before and including the head and 
the symbols jo . . . jk' followed by the non-visited positions after the head and with 
current state Sm by 

«)(5,,(...(5,„o))) 
(®(Sjo(---(%-°))) 

(®ti((8)...(®i„_it„)))) 

with n being the smallest number such that N < 2" and each of the t being tt or fF 
such that this is the binary coding of m . The closed terms So and Si extend the Turing 
tape where necessary, so we can define the one-step function of the Turing machine by 
sufficiently many case-distinctions. Iterating this one-step function polynomially many 
times (e.g., as shown by Hofmann [|o[ §4.3]) completes the (sketched) proof. 



5.2 Trees 

We sketch how our technique applies to the more complex data type of binary labeled 
trees; this is to be hoped for, as Hofmann's original method is capable of dealing with 
them [ pi] , §4.3]. Notice that the extension is not completely obvious, since iteration 
on trees involves two recursive calls. It turns out that the number of free variables in a 
term still is a good measure for the number of unfoldings of an iteration. 

The system is extended by 
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• a new type constructor T(t, p) for trees (with nodes labeled with elements of 
type T and leaves labeled with elements of type p) 

• new constants with their respective types 

leaf^,p /9^T(t,p) 

tree^,p O ^ r ^ T(r, p) ^ T(r, p) -o T(t, p) 

• a new term constructor { • , • } 

• a new typing rule 

tWMT (t(t,p)-) 



We inductively define the notion of a tree (with n nodes) by 

• \eair,ptP is a tree (with nodes) 

• if ti and t2 are trees (with ni and n2 nodes) then treer.pd^ a^tit2 is also a tree 
(with ni + 712 nodes) 

The conversion relation is augmented by 
\ea.{r,pt {s, r] f-^ 

tveer,pdatit2 {s, r} i— > sda{ti {s, r}){t2 {s, r}) ti, t2 trees 

We extend the definition of the length by 

|{s,r}| := \r\ 
and the definition of the polynomial bound by 

1? ({s, r}) := X ■ I? (s) + (X + 1) • i9 (r) + X • |s| + X • \r\ 

-d (t) + Xn-d (h) + Xn \h\ if t is a list with n entries 

and s is of the form {h} 

i9 (t) + X„ • z9 (s') + (Xn + 1) • i9 (r') 

+X„ ■ |s'| + Xn ■ \r'\ if < is a tree with n nodes 

and s is of the form {s' ,r'} 

■d (t) + •& (s) otherwise 



Then the theory in section ^ remains valid, with identical proofs. The new non-trivial 
subcase in Theorem 4.8 is 



tree-r, pdatit2 {s,r} i— > sda(ti {s, r}){t2 {s, r}) 
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where ti and <2 are trees with ni and 712 nodes respectively. Then by Lemma p3| and 
the linear typing discipline we know that rii +71,2 + 1 < \FY {treer.pdatit2 {s,r})\ < 
N, and hence 

1? {treer ^pdatit2 {s, r}) (N) 
+ \treer.pdat1t2 {s,r}\ 

^ (d) {N) + ^ (a) (N) + ^ (h) {N) + ?9 (ia) {N)+ 
+ {m +n2 + (s) (N) + (m + 712 + 1 + (r) (N) 
+ (ni + n2 + 1) \s\ + {ni + n2 + l) \r\ 
+ l + |d| + |a| + |ti| + |i2| + |r-| 

and 

^i.sda{h{s,r}){t2{s,r})){N) 
+ \sda{ti{s,r}){t2{s,r})\ 

d (s) (iV) + (d) (iV) + (a) (iV) 
+ I? (ti) (N) + m • ?9 (s) (A^) + (ni + 1) • I? (r) (TV) + m • |s| + m • |r| 

+ 13 (<2) (iV) + 772 • (s) (AT) + (772 + 1) • I? (0 (iV) + 772 ' |s| + ^2 • |r| 

+ |s| + |d| + |a| + |ii| + |r| + |i2| + |r| 
which is strictly smaller. 

It should be noted that, once iteration on labeled trees is available, "divide and conquer" 
recursion can be implemented: Starting from a "divide function" 

f : r ^ L(t) ^ t L(t) ® L(t) 

we can construct a function f : L(r) -o T(t, L(r)) with 

|(nil^) = leaf^L(r)nilr 

}{conSrd^ a'^ e) = treeT- da' ii£ 2 if fa£ ^ a' ig) £1 ® £2 

To do so we first construct via iteration over lists a function g: L(r) ^{t) x 
T(t, L(t)) with g{£) = (£,]£^- Iterating a function that appHes f to (the label of) 
every leaf of a tree sufficiently (i.e., linearly) often [|lO[ §4.3] we can define a function 
P : L(r) T(t, L(t)) such that 

P(ml^) = leaf^.L(r)(nilr) 

P(conSd*a^^) = tree^,L(r)rfa'(f'°^i)(P^2) if fa£ a' ® ^1 £2 

A final iteration over the created tree with an appropriate "conquer function" and an 
appropriate initial value finishes the implementation of "divide and conquer" recursion 
in the present system. 

Following these lines, we can, for example, specify quicksort by providing 

• a divide- function of type r — o L(t) — « t L(t) ® L(t) splitting a list into two 
sublists; one with the elements larger than the given one and one with smaller 
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elements 



Aa;^/^("^ /{Ay?,yr,p"®''^"'®" p(Aa;^p^(^)®^M p{Xl',l" 

Z{ ®r,L(r)«.L(r)a;('8)L(T),L(r)(cOnSryo2/l?')^")7 

®r,L(T)«.L(r)a;((8)L(T),L(r)?'(cOnSxyoyi?"))))))} 

<8'T,L(T)®L(T)a;((8)L(r),L(r)nilTnilT) 

• a conquer function of type O — o r — o L(t) — o L(r) — o L(r) taking two sorted 
lists and joining them in the correct way with the middle element 

where app is the append function, defined as usual 

• an initial case for the empty Ust, nil^ 

However, as in this implementation of "divide and conquer" recursion for each unfold- 
ing step the whole (intermediate) tree structure has to be traversed, it is less efficient 
than a "native" variant of "divide and conquer", but, as shown, still in polynomial 
time. 
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